Auto-generation of Class Diagram from Free-text Functional Specifications and Domain Ontology
نویسندگان
چکیده
In this paper, we propose and implement a system that automates the building of class diagram from free-text requirement documents. Our approach first applies natural language processing (NLP) techniques to understanding of the written requirements, and then uses domain knowledge represented by domain ontology to improve the performance of class identification. The basic notion of our approach is that, a class especially core class of the domain is always semantically connected with other classes and its attributes. Based upon this, our system applies a spider model to search classes of interest. It first identifies core classes of the domain as a starting point, with which we are usually highly confident, and further finds classes that are semantically related with the identified ones. The strength of this method is that it identifies classes and relationships at one step. Specifically, our approach aims at addressing the following challenging research questions: (1) How to use domain ontology but not limited to domain ontology? (2) How to find candidate concepts? (3) How to find concept pairs with strong semantic connection in the context? (4) How to distinguish attribute name from class name for each concept? (5) How to name each inter-class relationship, aggregation, generalization, or association? (6) Are there any missing classes or attributes? This methodology finds its way through NLP, which is proved to be effective and sufficient to provide solutions to these problems in this paper.
منابع مشابه
The Role of Ontologies in Schema-based Program Synthesis
Program synthesis is the process of automatically deriving executable code from (non-executable) high-level specifications. It is more flexible and powerful than conventional code generation techniques that simply translate algorithmic specifications into lower-level code or only create code skeletons from structural specifications (such as UML class diagrams). Key to building a successful synt...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملINDUCING VALUABLE RULES FROM IMBALANCED DATA: THE CASE OF AN IRANIAN BANK EXPORT LOANS
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملINDUCING VALUABLE RULES FROM IMBALANCED DATA: THE CASE OF AN IRANIAN BANK EXPORT LOANS
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کامل